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Abstract 

Transport in weighted networks is dominated by the minimum spanning tree (MST), the tree 
connecting all nodes with the minimum total weight. We find that the MST can be partitioned 
into two distinct components, having significantly different transport properties, characterized by 
centrality — number of times a node (or link) is used by transport paths. One component, the 
superhighways, is the infinite incipient percolation cluster; for which we find that nodes (or links) 
with high centrality dominate. For the other component, roads, which includes the remaining 
nodes, low centrality nodes dominate. We find also that the distribution of the centrality for the 
infinite incipient percolation cluster satisfies a power law, with an exponent smaller than that for 
the entire MST. The significance of this finding by showing that one can improve significantly the 
global transport by improving a very small fraction of the network, the superhighways. 
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Recently much attention has been focused on the topic of complex networks, which char- 
acterize many natural and man-made systems, such as the internet, airline transport system, 
power grid infrastructures, and the world wide web 

BBS 

. Besides the static properties 
of complex networks, dynamical phenomena such as transport in networks are of vital im- 
portance from both theoretical and practical perspectives. Recently much effort has been 
focused on weighted networks , where each link or node is associated with a weight. 
Weighted networks yield a more realistic description of real networks. For example, the 
cable links between computers in the internet network have different weights, representing 
their capacities or bandwidths. 

In weighted networks the minimum spanning tree (MST) is a tree including all of the 
nodes but only a subset of the links, which has the minimum total weight out of all possible 
trees that span the entire network. Also, the MST is the union of all "strong disorder" 
optimal paths between any two nodes (J [7, 0, 0, r], 11, [l^. The MST which plays a 
major role for transport is widely used in different fields, such as the design and operation of 
communication networks, the traveling salesman problem, the protein interaction problem, 
optimal traffic flow, and economic networks 

An important quantity that characterizes transport in networks is the betweenness cen- 
trality, C, which is the number of times a node (or link) used by the set of all shortest 



paths between all pairs of nodes 
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2l| . For simplicity we call the "betweenness cen- 



trality" here "centrality" and we use the notation "nodes" but similar results have been 
obtained for links. The centrality, C, quantifies the "importance" of a node for transport 
in the network. Moreover, identifying the nodes with high C enables, as shown below, to 



'he 
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improve their transport capacity and thus improve the global transport in the network, 
probability density function (pdf ) of C was studied on the MST for both scale-free (SF) 
and Erdos-Renyi (ER) [23[ networks and found to satisfy a power law, 

Pmst(C) ~ C- 5 ^ (1) 

with <5 MS T close to 2 0,Q- 

Here we show that a sub- network of the MST [^J , the infinite incipient percolation cluster 
(IIC) has a significantly higher average C than the entire MST — i.e., the set of nodes inside 
the IIC are typically used by transport paths more often than other nodes in the MST. - 
In this sense the IIC can be viewed as a set of superhighways (SHW) in the MST. The nodes 
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on the MST which are not in the IIC are called roads, due to their analogy with roads which 
are not superhighways (usually used by local residents). We demonstrate the impact of this 
finding by showing that improving the capacity of the superhighways (IIC) is surprisingly 
a better strategy to enhance global transport compared to improving the same number of 
links of the highest C in the MST, although they have higher C |26[. This counterintuitive 
result shows the advantage of identifying the IIC subsystem, which is very small compared 
to the full network [27]. Our results are based on extensive numerical studies for centrality 
of the IIC, and comparison with the centrality of the entire MST. We study ER, SF and 
square lattice networks. 

To generate a ER network of size N with average degree (k), we pick at random a pair of 
nodes from all possible iV(iV — l)/2 pairs, link this pair, and continue this process until we 
have exactly (k)N/2 edges. We disallow multiple connections between two nodes and self- 
loops in a single node. To construct SF networks with a prescribed pow er law distribution 
V(k) ~ k~ x with k > k min Q, we use the Molloy-Reed algorithm [3, We assign to 
each node i a random number fc, of links drawn from this power law distribution. Then we 
choose a node % and connect each of its k{ links with randomly selected k{ different nodes. 

To construct a weighted network, we next assign a weight Wi to each link from a uniform 
distribution between and 1. The MST is obtained from the weighted network using Prim's 
algorithm [3^]. We start from any node in the largest connected component of the network 
and grow a tree-like cluster to the nearest neighbor with the minimum weight until the 
MST includes all the nodes of the largest connected component. Once the MST is built, we 
compute the value of C of each node by counting the number of paths between all possible 
pairs passing through that node . We normalize C by the total number of pairs in the MST, 
iV(iV — l)/2, which ensures that C is between and 1 31|. 

To find the IIC of ER and SF networks, we start with the fully connected network and 
remove links in descending order of their weights. After each removal of a link, we calculate 
k = (k 2 ) / (k), which decreases with link removals. When k < 2, we stop the process because 
at this point, the largest remaining component is the IIC 32]. For the two dimensional 
(2D) square lattice we cut the links in descending order of their weights until we reach the 
percolation threshold, p c (= 0.5). At that point the largest remaining component is the 

IIC 0. 

To quantitatively study the centrality of the nodes in the IIC, we calculate the pdf, 
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Vnc(C) of C. In Fig. U we show for nodes that for all three cases studied, ER, SF and 
square lattice networks, Vnc(C) satisfies a power law 

Puc(C)~C- 5 ™, (2) 

where 

f 1.2 [ER, SF] 

fco« (3) 
I 1.25 [square lattice] 

Moreover, from Fig. [TJ we find that Sue < <5mst, implying a larger probability to find a 
larger value of C in the IIC compared to the entire MST. Our values for 5mst are consistent 
with those found in Ref. j^J. We obtain similar results for the centrality of the links. Our 
results thus show that the IIC is like a network of superhighways inside the MST. When we 
analyze centrality for the entire MST, the effect of the high C of the IIC is not seen since 
the IIC is only a small fraction of the MST. Our results are summarized in Table 

To further demonstrate the significance of the IIC, we compute for each realization of 
the network the average C over all nodes, (C). In Fig. we show the histograms of (C) for 
both the IIC and for the other nodes on the MST. We see that the nodes on the IIC have a 
much larger (C) than the other nodes of the MST. 

Figure El shows a schematic plot of the SHW inside the MST and demonstrates its use by 
the path between pairs of nodes. The MST is the "skeleton" inside the network, which plays 
a key role in transport between the nodes. However, the IIC in the MST is like the "spine in 
the skeleton" , which plays the role of the superhighways inside a road transportation system. 
A car can drive from the entry node A on roads until it reaches a superhighway, and finds 
the exit which is closest to the exit node B. Thus those nodes which are far from each other 
in the MST should use the IIC superhighways more than those nodes which are close to 
each other. In order to demonstrate this, we compute /, the average fraction of pairs of 
nodes using the IIC, as a function of ^mst 5 the distance between a pair of nodes on the MST 
(Fig. Hj). We see that / increases and approaches one as £mst grows. We also show that / 
scales as Im stA /V^ for different system sizes, where z/ opt is the percolation connectedness 
exponent [flL llfll]. 

The next question is how much the IIC is used in transport on the MST? We define the 
IIC superhighway usage, 

u = - f , (4) 
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where 4ic is the number of the links in a given path of length £mst belonging to the IIC 
superhighways. The average usage (u) quantifies how much the IIC is used by the transport 
between all pairs of nodes. In Fig. EJa), we show (u) as a function of the system size N. 
Our results suggest that (u) approaches a constant value and becomes independent of N 
for large N. This is surprising since the average value of the ratio between the number of 
nodes on the IIC and on the MST, (Nuc/Nust), approaches zero as N — > oo (27 1, showing 
that although the IIC contains only a tiny fraction of the nodes in the entire network, its 
usage for the transport in the entire network is constant. We find that (u) ~ 0.3 for ER 
networks, (u) ~ 0.2 for SF networks with A = 4.5, and (u) ~ 0.64 for the square lattice. 
The reason why (u) is not close to 1.0 is that in addition to the IIC, the optimal path 
passes through other percolation clusters, such as the second largest and the third largest 
percolation clusters. In Fig. El we also show for ER networks, the average usage of the two 
largest and the three largest percolation clusters for a path on the MST and we see that the 
average usage increases significantly and is also independent of N. However, the number of 
clusters used by a path on MST is relatively small and proportional to In N [34] , suggesting 
that the path on the MST uses only a few percolation clusters and a few jumps between 
them (~ IniV) in order to get from an entry node to an exit node on the network. When 
iV — > 00 the average usage of all percolation clusters should approach 1. 

Can we use the above results to improve the transport in networks? It is clear that by 
improving the capacity or conductivity of the highest C links one can improve the transport 
(see Fig. Efb) inset). We hypothesize that improving the IIC links (strategy I), which 
represent the superhighways is more effective than improving the same number of links with 
the highest C in the MST (strategy II), although they have higher centrality 26]. To test the 
hypothesis, we study two transport problems: (i) current flow in random resistor networks, 
where each link of the network represents a resistor and (ii) the maximum flow problem well 

. We assign to each link of the network a resistance/capacity, 
e ax , where x is an uniform random number between and 1, with a = 40. The value 
of a is chosen such as to have a broad distribution of disorder so that the MST carries 
most of the flow 10, [yj. We randomly choose n pairs of nodes as sources and other n 
nodes as sinks and compute flow between them. We compare the transport by improving 
the conductance/capacity of the links on the IIC (strategy I) with that by improving the 
same number of links but those with the highest C in the MST (strategy II). Since the two 
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sets are not the same and therefore higher centrality links will be improved in II 
is tempting to suggest that the better strategy to improve global flow would be strategy 
II. However, here we demonstrate using ER networks as an example that counterintuitively 
strategy I is better. We also find similar improvements of strategy I compared to strategy 
II for SF networks with A = 3.5. In Fig. GJb), we compute the ratio between the flow using 
strategy I (F s i) and the flow using strategy II (F s u) as a function of the factor of improving 
conductivity/capacity of the links. The figure clearly shows that strategy I is better than 
strategy II. Since the number of links in the IIC is relatively very small comparing to the 
number of links in the whole network |22( , it could be a very efficient strategy. 

In summary, we find that the centrality of the IIC for transport in networks is significantly 
larger than the centrality of the other nodes in the MST. Thus the IIC is a key component 
for transport in the MST. We demonstrate that improving the capacity/conductance of the 
links in the IIC is useful strategy to improve transport. 

We thank ONR, Israel Science Foundation, European NEST project DYSONET, FON- 
CyT (PICT-O2004/370) and Israeli Center for Complexity Science for financial support. 
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ER 


SF (A = 4.5) 


SF (A = 3.5) 


square lattice 


Sue 


1.2 


1.2 


1.2 


1.25 


<5mst 


1.6 


1.7 


1.7 


1.32 


Vopt 


1/3 


1/3 


0.2 


0.61 


(u) 


0.29 


0.20 


0.13 


0.64 



TABLE I: Results for the IIC and the MST 
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(a) ER Networks 
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FIG. 1: The pdf of the centrality of nodes for (a) ER graph with (k) = 4, (b) SF with A = 4.5, 
(c) SF with A = 3.5 and (d) 90 x 90 square lattice. For ER and SF, N = 8192 and for the square 
lattice N = 8100 . We analyze 10 4 realizations. For each graph, the filled circles show Vnc(C); 
the unfilled circles show 'Pmst(C). 
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FIG. 2: The normalized pdf for superhighway and roads of (C), the C averaged over all nodes in 
one realization, (a) ER network, (b) SF network with A = 4.5, (c) SF network with A = 3.5 and 
(d) square lattice network. To make each histogram, we analyze 1000 network configurations. 
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FIG. 3: Schematic graph of the network of connected superhighways (heavy lines) inside the MST 
(shaded). A, B and C are examples of possible entry and exit nodes, which connect to the network 
of superhighways by "roads" (thin lines). The middle size lines indicates other percolation clusters 
with much smaller size compared to the IIC. 
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(a) ER Networks 
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(b) SF Networks 
X=4.5 



10 

/ /NVt 



15 



20 



Q L ^EafflB O ED gum 



(d) LxL square lattice 
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FIG. 4: The average fraction, (/), of pairs using the SHW, as a function of ^msTj the distance 
on the MST. (a) ER graph with (k) = 4, (b) SF with A = 4.5, (c) SF with A = 3.5 and (d) 
square lattice. For ER and SF: (O)N = 1024 and (n)N = 2048 with 10 4 realizations. For 
square lattice: (O)^ = 1024 and (a)N = 2500 with 10 3 realizat ions. The x axis is rescaled 
by A^°p% where i> ovt = 1/3 for ER and for SF with A > 4, and u opt = (A - 3)/(A - 1) for SF 
networks with 3 < A < 4 j|. For the LxL square lattice, £mst ~ L dopt and since L 2 = N, 
v op t = d opt /2 « 0.61 



13 



0.8 



A 


0.6 


13 




V 




CD 




O) 




CO" 




U) 
13 


0.4 


CD 




O) 




CO" 

1 




CD 










0.2 




0.0 



10 



10' 



AAaAAAa (a) 




A 

- sC 

V 

A_ 

■^L 10° 

^ w 10" 10' 10' 10 
■ v factor of improving 



(b) 



5000 10000 
number of nodes N 



15000 1 10 ° 



9 ♦ 
t I " 



O 



10 1 10' 

factor of improving 



O 



FIG. 5: (a) The average usage (u) = (^iic/^mst) for different networks, as a function of the number 
of nodes N. O (ER with (k) = 4), □ (SF with A = 4.5), O (SF with A = 3.5), A (L x L square 
lattice). The symbols (>) and (<) represent the average usage for ER with (k) = 4 when the 
two largest percolation clusters and the three largest percolation clusters are taken into account, 
respectively, (b) The ratio between the flow using strategy I, F s i, and that using strategy II, F s n, 
as a function of the factor of improving conductivity /capacity. The inset is the ratio between the 
flow using strategy I and the flow in the original network, Fq. The data are all for ER networks 
with N = 2048, (k) = 4 and n = 50(O), " = 250(0) and n = 500(D). The unfilled symbols are 
for current flow and the filled symbols are for maximum flow. 
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